Model Selection

Chinese Multimodal

# Chinese Multimodal

Chinese Clip Vit Base Patch16

Chinese CLIP model based on ViT architecture, supporting multimodal understanding of images and text

Mengzi Oscar Base Caption

A Chinese multimodal image captioning model fine-tuned on the AIC-ICC Chinese image caption dataset, based on the Mengzi-Oscar pretrained model

Transformers Chinese

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase